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Abstract — We consider the downlink of a cellular network with 
multiple cells and multi-antenna base stations under arbitrary 
inter-cell cooperation, realistic distance-dependent pathloss, and 
general "fairness" requirements. Beyond Monte Carlo simulation, 
no efficient computation method to evaluate the ergodic through- 
put of such systems has been provided so far. We propose an 
analytic method based on the combination of the large random 
matrix theory with Lagrangian optimization. The proposed 
method is computationally much more efficient than Monte Carlo 
simulation and provides a very accurate approximation (almost 
indistinguishable) for the actual finite-dimensional case, even for 
of a small number of users and base station antennas. Numerical 
examples include linear 2-cell and planar three-sectored 7-cell 
layouts, with no inter-cell cooperation, sector cooperation, and 
full cooperation. 

I. Introduction 

Multiuser MIMO (MU-MIMO) technology is expected to 
play a key role in future wireless cellular networks. The MIMO 
Gaussian broadcast channel serves as an information theoretic 
model for the MU-MIMO downlink and its capacity was 
studied and characterized in |[l)-|(3). In a multi-cell scenario, 
depending on the level of base station (BS) cooperation, we 
are in the presence of a MIMO broadcast and interference 
channel problem, which is not yet fully understood in the 
information theoretic sense. In this work, we restrict our 
attention to the case where clusters of cooperating BSs act as 
a single distributed MIMO transmitter and where interference 
from other clusters of BSs is treated as noise. Our framework 
encompasses arbitrary cooperation clusters. 

Multi-cell systems under full and partial cooperation have 
been widely studied under the so-called "Wyner model" [4|- 
l?) which assumes a very simplified and rather unrealistic 
model in regard to the pathloss and makes the system es- 
sentially identical with respect to any given user As a matter 
of fact, users in different locations under the cellular coverage 
are subject to distance-dependent pathloss that may have more 
than 30 dB dynamic range [8 |. It follows that in these realistic 
propagation conditions the channels from a BS to users are 
not identically distributed at all. Furthermore, studying the 
downlink sum-capacity (or achievable sum-throughput, under 
some suboptimal scheme) is not a very meaningful approach 
from a system performance viewpoint: rate and power opti- 
mization would lead to the solution of letting a BS transmit 
only to users very close to the BS, while leaving users at the 
cell edges to starve. In order to avoid this problem, downlink 
scheduling has been widely studied, which achieves a good 



balance between network efficiency and "fairness" (see for 
example |[9[-pT) and refs. therein). The goal of fairness 
scheduling is to make the system operate at a rate point in 
its ergodic achievable rate region such that a suitable network 
utility function is maximized | [T2| . Taking into account realistic 
pathloss models, multi-cell systems with arbitrary inter-cell 
cooperation, and fairness requirements makes system analysis 
extremely complicated. In fact, so far, the system performance 
of such systems has been evaluated only through extensive and 
computationally very intensive Monte Carlo simulation. 

In this work, we consider a particular "large system limit" 
that is studied with known results from the large random 
matrix theory [13J-[15j and combine it with Lagrangian opti- 
mization in order to obtain a numerical "almost" closed-form 
analysis tool that incorporates all the above system aspects. 
The proposed method is much more efficient than Monte 
Carlo simulation and, somehow surprisingly, provides results 
that match very closely the performance of finite-dimensional 
systems, even for very small dimension. 

II. System setup 

Consider M BSs with -fN antennas each, and KN single- 
antenna user terminals distributed in the cellular coverage 
region. Users are divided into K co-located "user groups" 
of equal size N. Users in the same group are statistically 
equivalent: they experience the same pathloss from all BSs 
and their small-scale fading channel coefficients are iid. [^The 
received signal vector y^. ~ [yk,i • • • Vk.NV ^ for user 
group k is given by 

M 

Yk = ^ a,n,k'H.m,k^m + nk (1) 

m— 1 

where ara,k and lim,k denote the the distance depen- 
dent pathloss and jN x N small-scale channel fading ma- 
trix from BS m to user group k, respectively, = 
[xm.i ■ ■ ■ Xm.-yNV ^ C*^^ is the transmitted signal vector of 
BS m subject to the power constraint tr(Cov(Xm)) < P, 
and rife — [uk^i ■ ■ ■ Uk^Nf & denotes the AWGN at 
the user terminals. The elements of and of 'H.m,k are iid 
-CA/'(0,1). 

'in practice, co-located users are separated by a sufficient number of 
wavelength such that they undergo iid small-scale fading. However, the 
wavelength is sufficiently small so that they all have essentially the same 
distance-dependent pathloss. 



We define the BS partition {A^i, - - ,Ml} of set 
{I,- - ; M} and the corresponding user group partition 
{/Ci, • • • , ICl} of set {1, • • • , K}, where L is the number of 
cooperation clusters. Cooperation cluster i is formed by the 
BSs in A4e, acting effectively as a distributed network MIMO 
transmitter, and the user groups in ICi. We assume that the BSs 
in each cluster have perfect channel state information (CSI) for 
all users in the same cluster and statistical information (i.e. 
known distributions but not the instantaneous values) relative 
to channels from BSs in other clusters. The impact of non- 
perfect CSI at the transmitter and the required overhead for 
CSI estimation for inter-cell cooperation is addressed in y_6J. 
We restrict to the symmetric cellular structure for clusters 
such that each cluster has the same number of BSs and the 
same distribution of user groups with respect to all the BSs 
in the cluster Specific examples of such a structure will be 
given in Section |V] Under these assumption, owing to the 
symmetric nature of clusters, the sum transmit power of each 
BS is equal to P when we consider the cluster-wise sum- 
power constraint |A^f|P and it is satisfied with an equality. 
Then, the interference plus noise variance at any user terminal 
in group k E ICg is obtained as 



= 1 



P 



(2) 



and, from the viewpoint of cluster i, the system is equivalent 
to a single-cell MU-MIMO downlink given by 



yt = Hyx£ + ze 



(3) 



where £ C''^^'^ is the received signal vector formed by 
{yfe : k S ICl}, zg e (j^l'Cfl^^ ^ Gaussian interference plus 
noise vector with independent elements and block-diagonal 
covariance matrix E[z^Z£ ] = diag(tT|lAr : k E ICi), and E 



-r\Me\Nx\Ke\N 



is formed as 

^7ni ,ki H^^j^ ^/i;^ 



»^mi,fe|Kj,|Hmi,fc|Kj| 



(4) 



For the sake of system optimization, we introduce the "dual 
uplink" channel model Q, p7| , p8) corresponding to ([3]) 
given by 

r = Hs + w (5) 

where we define the transmit signal s E with diagonal 

covariance matrix Q subject to tr(Q) < Q, the AWGN vector 
w ^ CN{0,1bn), and the channel matrix H E ^^^'^^^ 
formed by blocks fcH,„ jt in the same way as in (j4i, and 
where, for notation simplicity, we let A = \JCi\, B = '-f\A4i\, 
Q =\Mi\P, and I3^,k = ^. 

We assume that the pathloss coefficients are fixed while 
the small-scale fading coefficients change in time accord- 
ing to some ergodic process with the assigned first-order 
iid CAf{0, 1) distribution. This is representative of a typical 
situation where the distance between BSs and users changes 



significantly over a time-scale of the order of tens of seconds, 
whereas the small-scale fading decorrelates completely within 
a few milliseconds [19 J . In this regime, the goal of downlink 
scheduling is to maximize a suitable strictly increasing and 
concave network utility function g{-) of ergodic user rates 
|i2| . By symmetry, all users in the same group should achieve 
the same ergodic rate. However, users in different groups may 
operate at different rates, depending on the network utility 
function g{-). For R — [Ri,R2,--- ,Ra] where Rk is the 
ergodic group rate given by the mean user rate of user group 
k, the fairness scheduling problem is formulated as 



maximize 
subject to 



.9(R) 

R e 7^c 



(6) 



where TZc^giQ) is the region of ergodic achievable group rates 
for system (jSll (equivalently, for cluster £ with channel model 
defined in ([3t) for given sum-power constraint Q. 

III. Weighted ergodic sum-rate maximization 

In this section, we recall the computation algorithm of | [T5| 
to solve, in the limit for — > oo, the weighted ergodic sum- 
rate maximization problem: 



maximize 



Y^WkRk 



subject to Re TZcigiQ). 



(7) 



This is a fundamental building block for the computation of 
(|6]l. The exposition in this section is necessarily brief and more 
details can be found in [15] and references therein. 



Let TT denote the permutation juch that W7rj^< 



and define 0^;^ = Hk-.AQk-.A'H.k-.A^ where H^:^ and Qk-.A 
denote the channel and the input covariance sub-matrices re- 
stricted to user groups 7rfc, . . . , tt^. By symmetry, we have that 
all users in the same group k are allocated the same (uplink) 
power Qk- After dividing the channel coefficients by y/N, 
the power constraint for any N is given by '^k=i Qk ^ Q- 
For the dual uplink channel the maximum weighted sum- 
rate is achieved by a block successive interference cancelation 
decoding strategy that jointly decodes users from group tti 
to tta, and, after decoding each group, subtracts it from the 
received signal. It follows that problem (j?]) reduces to the 
maximization with respect to Q of the objective function 



A 



Fw(Q) 



1 



^Afc-E[log|I+0,:^|] 

k=l 



(8) 



where W^^ - W^^_ 



with VFtto = 0. From the Karush- 



Kuhn-Tucker (KKT) conditions applied to the Lagrangian 
function £(Q,A) = -Fw(Q) — A(tr(Q) — Q), we can elim- 
inate the Lagrange multiplier A by imposing that the power 
constraint holds with equality and then we obtain 



(9) 





'l -E 




) 






) 



where mmse^ = itr(E[I- Q^.H^^ (I + 0,,^)-iH^J) for 
fc > i is the average Minimum Mean Square Error (MMSE) 
for the input symbols of group tt^ when the symbols of groups 
TTi, . . . , 7ri_i are cancelled from the observation r in (j5]l, and 
Htt^, denotes the BN x N submatrix of H corresponding to 

group TTk- 

For finite N, the amount of calculation in order to evaluate 



the solution of ^ is tremendous because E[mmse|.'^J must 
be computed by Monte Carlo simulation. In the following, 
we consider the large system regime where we let N oo 
and, by using the asymptotic random matrix theory of fT3l 
we arrive at a computationally efficient algorithm. Let T,[' 
lim 



(k) 



N- 



(k) 

mmse.-. i and define T 



(fc) _ 



- 1 and then a 

direct application of |l3| Lemma 1] yields F^*^ as the solution 
of the fixed-point equation 



.(fc) 
i:A 



32 



i+r 



(fc) 



(10) 



Combining (lOi with the iterative algorithm |20 Algorithm 
1] that converges to the solution of (|9]), we obtain Algorithm 
[T] below, (for notation simplicity, tt^ = fc is assumed for all 
k^l,...,A). 

Algorithm 1 Algorithm for weighted sum-rate maximization 



1) Initialize Q(0) = ^Ia- 



2) For £ = 0,1,2, 
settles: 

Qfc(^ + i) = g 



iterate until the following solution 



EtiELiA.(i-T|^W) 



(11) 



for fc — 1, . . . ,A, where T 



(fc) 



^{i) = 1/(1 + r,^^ 



.(fc) 



and r^.^(£) is obtained as the solution (also obtained 



3) Denote by t['^1{oo), Tl':J^{oo), and Qfe(oo) the fixed 
points reached by the iterations in step 2). If the condi- 
tion 

q^a^fSm < x:ea.(i-t(^m; 

i—1 j — 1 i—1 

is satisfied for all fc such that Qk{oo) ~ 0, then stop. 
Otherwise, set Qk — for fc corresponding to the lowest 
value of AiFj-.*^(oo) and repeat steps 2) and 3). 

After group powers = Qk{<x)) have been obtained from 
Algorithm [T] it remains to compute the ergodic group rates. 
We have 

R^, = [log |I + 0,:^|] - 1e [log |I + 0fc+l:^|] . (12) 

In the limit for N — > 00, we can use the asymp- 
totic analytical expression for the mutual information given 
in pTj . Adapting | |T4} Result 1] to our case, we obtain 
limjv-i-oo [log |I + ®fe:A|] from expression (21) in |15|. 



by iteration) of ( lOl for powers Qk = Qfc(^). 



-(fc) 



IV. Introducing fairness 

In a finite dimensional system, a dynamic scheduling policy 
allocates powers and rates in order to obtain the system ergodic 
rate point (i.e. the time-average user rates) as close to the 
solution of (|6]l as possible. This can be systematically obtained 
by the stochastic optimization approach of |10|, |11|, based 
on the idea of "virtual queues". For a deterministic network, 
we can exploit Lagrangian duality with outer subgradient 
iteration, where the Lagrange dual variables play the roles of 
the virtual queue backlogs of the dynamic scheduling. In the 
large system limit, the channel uncertainty disappears and the 
multi-cell MU-MIMO system becomes indeed deterministic. 
Hence, problem ([6]) for large N can be addressed directly, 
using Lagrangian duality. 

We rewrite (j6| using auxiliary variables r = [ri, • • • , ta] as 
follows: 



max 

r,Q,7r 
S.t. 



5(r) 



< 



1 



E 



log 



II 



f Qfc:A| 
0fe+l:A| 



Vfc, 



N 

tr(Q) < Q. 

The Lagrangian function of problem ( [T3] l is given by 
/:(r, Q,7r,/x) 



(13) 



= 5(r)- 



A 



1 



k=l k=l 



log 



k:A\ 



1 + 



k + l:A\ 



(14) 

where /j, denotes the dual variables for the rate constraints. 
The Lagrange dual function for ^T4\ is given by 

Gip) = max C{r,Q,Tr,fi) 

r,Q,7r 



max 

Q.rr 



(15) 



(a) (6) 

and it is obtained by the decoupled maximization in (a) (with 



respect to r) and in (b) (with respect to Q, tt) in ( 15 1. Finally, 
we solve the dual problem defined as 



mm 

fl>0 



(16) 



Since the maximization of (b) is a weighted ergodic sum-rate 
maximization for weights W = /x, it follows that the optimal 
TT* is the permutation that sorts fi in non-decreasing order (see 
Section IIIi. For tt — tt*, the Lagrangian function is concave 



in r and Q and convex in /i and the solution (saddle point of 
the min-max problem) can be found via inner-outer iterations. 
Inner Problem: For given fi, we solve the maximization 



problem in ( 15 1 with respect to r, Q and tt. 



• Subproblem (a): Since ,ffj,{r) is concave in r > 0, 
the maximum of fp,{v) is achieved by imposing the KKT 
conditions: 

drk 



fik < 0, Vfc 



(17) 



where the equaUty must hold for all k such that the solution 
is positive, i.e., > 0. 

• Subproblem (b): As said, this problem is a weighed 
ergodic sum-rate maximization and can be solved for — oo 
by Algorithm [T] 

Outer Problem: Once the optimal r*, Q* and tt* are 
obtained for given fi, the minimization of with respect 
to /X > can be performed by a subgradient-based method. 
For any fixed fi' and fi, we have 

Giti') = max /;^,(r) +max h^,iQ,n*) 



>ft,'{r*) + h'^,{Q*,7:*) 

A 



fc=i 



(18) 



where R-^^ is the rate of user group tt^ obtained from Algo- 
rithm [T] with weights fj,. Then, the vector with components 
R^t — Ttt* is the subgradient for Gifi) and dual variable /x^^ 
is updated at each outer iteration n as 

+ = /i^*(n)-s(n) (i?^.(7i) - r^*(n)) , V /c (19) 

for some step size s{n) > which can be determined effi- 
ciently by a back-tracking line search method |(2T|. It should be 
noticed that by setting s{n) = 1 this subgradient update plays 
the role of the virtual queue update in the dynamic scheduling 
policy of |[T0|, |[rT|. 

As an application example of the above general optimiza- 
tion, we focus on the two special cases, proportional fairness 
scheduling (PFS) and hard fairness scheduling (HPS), also 
known as max-min fairness scheduling. PFS corresponds to 
the network utility function ^(r) = J2k=i C'log(7'fc) for some 
constant C > and the optimality condition (17i yields 
fkin) — C / Hkin) at outer iteration n. 

In the case of HFS, the network utility function is given as 
5(r) = minfe=i ... Crfc and the corresponding optimization 
problem can be rewritten as 



max Cr 

r,Q,7r 



S.t. 



tr(Q) < g. 



log 



\l+®k:A\ 

1 + 



k+l:A\ 



Vfc 



(20) 



Replicating the former approach for this problem, with the 
corresponding changes, we find r{n) — \ Ylik=i-^-^k{'^) 
outer iteration n. 

V. Numerical results and discussion 

We present numerical examples for a one-dimensional 2- 
cell model (M — 2) and two-dimensional three-sectored 7- 
cell model (M ~ 21). In both models, the system parameters 
and pathloss model are based on the mobile WiMAX system 
evaluation specification [8|, except for cell radius 1.0 km and 
no shadow fading assumption. In the 2-cell model, we assume 
two one-sided cells with BSs located at -1 km and 1 km with 
7 = 4, and K = % user groups equally spaced between the 
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Fig. 1. Individual group rates under (a) PFS and (b) HFS for 7 
= 8 in the 2-cell model 



4 and 



two BSs. We consider the case of full BS cooperation and 
no cooperation with a symmetric partition of user groups, 
namely: L = 2, /Ci = {1,2,3,4} and /C2 = {5,6,7,8}. 
Fig. [T] illustrates the individual group rates (i.e. mean user 
rates of each group) in the large system limit as a function of 
group locations and compares them with the achievable rates 
obtained by Monte Carlo simulation in finite dimensions with 
TV = 1, 2, or 4. In the simulation with finite N , the BSs are 
equipped with antennas and N users are located at each 
of K locations. Channel vectors are randomly generated and 
the dynamic scheduling fTTl and DPC precoding with water- 
filling algorithm |22 ) are applied to each realization of channel 
vectors. In plots (a) and (b), the PFS and HFS are considered, 
respectively. Remarkably, the rates obtained by the large 
system analysis almost overlap with those obtained by the 
finite-dimensional simulations, even for = 1. Notice that the 
dynamic scheduling policy should provide multiuser diversity 
gain and achieve in general higher rates than the large system 
limit which is not able to exploit the dynamic fluctuations 
of the small-scale fading due to "channel hardening" effect. 
However, it appears that in the regime where the pathloss is 
dominant over the randomness of multi-antenna channels and 
the number of users is not much larger than the number of 
BS antennas, the multiuser diversity gain is negligible and 
the asymptotic analysis produces rate points very close to the 
outputs of simulations with dynamic scheduling. Notice that in 
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Fig. 2. Ergodic group rate distribution under PFS in the 7-cell model 



HFS case all the users achieve the same individual rate which 
is slightly higher than the smallest rate under the PFS. 

Using the proposed asymptotic analysis, validated with the 
simple 2-cell model, we can obtain ergodic rate distributions 
for much larger systems, for which a full-scale simulation 
would be very demanding. We consider a two-dimensional 
cell layout where 7 hexagonal cells form a network and each 
cell consists of three 120-degree sectors. Three BSs are co- 
located at the center of each cell such that each BS handles 
one sector in no cooperation case. Each sector is split into 4 
diamond-shaped equal-area grids and one user group is placed 
at the center of each grid. Therefore there are total M = 21 
BSs and X = 84 user groups in the network. In addition, we 
assume the 7 cells are wrap-around in a torus topology, such 
that each cell is virtually surrounded by the other 6 cells and all 
the cells have the symmetric inter-cell interference distribution. 



The antenna orientation and pattern follows 1 23 1 and the non- 
ideal antenna pattern generates inter-cell interference even 
between co-located sectors with no cooperation. This model 
indeed conjectures very large scale of cellular networks in 
that the dominant inter-cell interference and effective inter- 
cell cooperation gain comes from directly neighboring cells. 
Fig. |2] shows the user rate distribution under three levels of 
cooperation, (a) no cooperation (L ~ 21), (b) cooperation 
among co-located 3 sectors (L = 7), and (c) full cooperation 
over 7-cell network (L — 1). The simulation is very intricate 
especially in case (c). From the asymptotic rate results, it is 



shown that in case (b), the cooperation gain over case (a) is 
primarily obtained for users around cell centers, whereas the 
gain is achieved over all the locations in case (c). 
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